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BAHL DECODING 



FIELD OF THE INVENTION 



The present invention relates to decoding and in particular to methods for implementing 
Bahl decoding. 



Messages transmitted over a wireless medium are encoded in order to reduce the 
chances that noise will corrupt the message. The encoding translates an original block of an 
original message which comprises a plurality of bits which are either '0' or T to an encoded 
block which is formed of a plurality of symbols which are either , 0 f or T. The encoding of the 
block usually includes adding redundancy (i.e., making the encoded block longer than the 
original block), introducing symbol dependency (i.e., making the value of each symbol in the 
encoded block a function of a plurality of bits in the original block) and interleaving the block 
(i.e., mixing the order of the symbols in the encoded block). In modern codes the size of the 
original block is usually very large, of the order of hundreds, thousands or tens of thousands of 
bits. Thus, the encoding turns a block of binary bits into a block of binary symbols. It is noted 
that there is not necessarily a direct correlation between the value of a single bit and the value 
of any one of the encoded symbols. However, the number of bits in the original block and the 
number of symbol groups in the encoded block are generally the same. Generally in 
convolutional codes, bits and symbol groups in corresponding locations within the block are 
related in that they form pairs of input and output of encoders and decoders, as described 
hereinbelow. 

In convolutional codes, an encoder which has a fixed number of possible states is used 
to encode the original block. Each bit of the original block which enters the encoder changes a 
current state of the encoder to a next state according to the value of the entering bit. It is noted 
that for convolutional codes, from each state the encoder can move only to two other states 
based on whether the input is '0' or T. For each entering bit the encoder generates a group of 
one or more symbols forming the transmitted message. In other trellis codes, larger inputs are 
received by the encoder in each iteration, and the number of possible state changes are larger 
accordingly. In decoding, the decoder assigns each state a state metric (SM), which represents 
the conditional probability that the encoder was in that state. 

During transmission of the message, noise is added to the transmitted encoded message, 
and a decoder is used to extract the original message from the received message, which 
includes the noise. Normally, each symbol in the received message is represented in the 
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receiver, before decoding, by a word which represents the chance that the symbol is a logical 
T. A high positive value of the word means that the symbol is a logical T with high 
confidence, while a negative value with a high absolute value means the symbol is a logical '0* 
with high confidence. A word with a zero value means that the symbol is a '0 1 or T with equal 
probability. This representation is referred to herein as soft data. 

Some decoding methods include one or more iterations of a step of an A-Posteriori 
probability estimation in which the soft data representation of each symbol is adjusted and/or 
represented by a hard value (i.e., either , 0' or T) based on the values of other symbols in the 
block of the adjusted symbol. 

Many methods for performing the adjustment step are based on the Bahl algorithm 
which is described, for example, in Steven S. Pietrobon, "Implementation and Performance of a 
Turbo/MAP Decoder", Int. J. Satellite Commun., vol. 16, pp. 23-46, Jan.-Feb. 1998, the 
disclosure of which is incorporated herein by reference. In the Bahl algorithm, for each 
received symbol group, two vectors of state metrics are prepared: a forward vector which 
represents the conditional probability of the encoder being in each of the possible states 
immediately before the symbol group was generated and a backward vector which represents 
the conditional probability of the encoder being in each of the possible states immediately after 
the symbol group was generated. The forward vector is calculated based on the received 
symbols which precede the adjusted symbol group in the block and the backward vector is 
calculated based on the received symbols which follow the adjusted symbol group in the block. 
The forward and backward vectors are recursively dependent on the previous state metric 
vector in the respective directions from the end points of the block. 

The forward and backward vectors of each symbol group are supplied concurrently to a 
log likelihood ratio (LLR) unit which provides an adjusted output value of the symbol group 
based on the forward and backward vectors and the original value of the symbol group. The 
output value is chosen according to the most likely transitions between the state metrics of the 
forward and backward vectors. 

After the vectors are supplied to the LLR unit, the vectors may be discarded unless they 
are needed for calculating vectors of other symbol groups. Therefore, it is preferred, in order to 
save storage, that the vectors be calculated immediately before they are used. However, due to 
the recursive nature of the vectors which are calculated from opposite sides of the block, such 
immediate calculation, usage and discarding is performed in the prior art only in one direction, 
e.g., in the forward direction. 
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A conventional prior art method for calculating the output values of all the symbols in a 
block comprises two passes over the block. In a first pass over the block, performed before the 
LLR unit begins the adjusting, the backward recursion vectors for all the symbols in the block 
are calculated and stored. In a second pass over the block, beginning from the opposite end of 
the block, the forward recursion vectors are calculated, the respective backward recursion 
vectors are retrieved from the storage and the forward and backward vectors are supplied to the 
LLR unit. The supplied vectors are immediately used by the LLR unit and then they are 
discarded. 

The storage requirements in this method, required in order to store the backward 
recursion vectors, are quite large. In order to reduce the required storage space, it has been 
suggested to perform the symbol adjustments in sub-blocks in which each symbol is influenced 
only by the symbols in its sub-block, rather than by all the symbols in the block. Such method 
is described, for example, in PCT publication WO98/20617 the disclosure of which is 
incorporated herein by reference. The WO98/20617 publication shows that by beginning the 
recursion calculations from a point sufficiently before the beginning of the sub-block, the 
recursion values corresponding to the symbols in the sub-block are sufficiently valid although 
the recursion did not start from the beginning of the block. Even so, the storage requirements 
for a 16-state code, using 10-bit state metrics for blocks of 40 symbols, is 6400 bits. When 
implementing a hardware decoder such an amount of storage is very expensive. 

Fig. 1A is an exemplary trellis map 10 of the convolutional code 7.5 which is known in 
the art. The encoder of the 7.5 code may be in any of four states s(n) 12 identified by a 
combination of two bits. From each of the states s(n), the encoder can move to one of two other 
states s(n+l) based on an input bit d(n), as indicated by branches 14. In moving from state s(n) 
to state s(n+l) 16, the encoder provides a symbol group formed of a pair of bits co(n) and 
ci(n). The symbol group cg(n) and cj(n) is transmitted to a receiver which uses a decoder to 
determine the original bits d(n). The decoder assigns each received symbol group with a vector 
of branch metric (BM) values 5 n (co,ci) which designate the conditional probability that the 
received symbol group originates from the transmitted symbol group designating cq and cj. In 
addition, the decoder calculates forward (a n ) and backward (p n ) state metric vectors which 
have elements for each of the states s(n) which designate the conditional probability that the 
decoder was in the state s(n) at time n. Generally, forward vectors a n are recursively dependent 
on preceding vectors (a n +i = f(a n , 5 n )), and backward vectors (3 n are recursively dependent 
on following vectors (P n = f(P n -hl > S n )). 
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As can be seen in Fig. 1A, Each pair of states s(n) is connected to a pair of states 
s(n+l) although not necessarily the same states. The pairs of states with the connecting 
branches form a butterfly trellis which is depicted in Fig. IB which is a butterfly trellis diagram 
for convolutional codes. This feature of pairs of states is generally true in trellises of 
5 convolutional codes. In Fig. 1 A states 00 and 01 (which are equivalent to states X and Y in fig. 
IB) at time n are connected to states 00 and 10 (which are equivalent to states Z and W in fig. 
IB) at time n+1. Similarly, states 10 and 1 1 at time n (which are equivalent to states X and Y in 
fig. IB) are connected to states 01 and 1 1 at time n+1 (equivalent to states Z and W in fig. IB). 

SUMMARY OF THE INVENTION 
10 An aspect of some preferred embodiments of the invention relates to reducing the 

amount of memory required for decoding convolutional codes. 

An aspect of some preferred embodiments of the present invention relates to a method 
m of providing pairs of forward and backward vectors concurrently to an LLR unit for A- 
: % Posteriori probability estimation of a decoded block. During a first stage, before the beginning 
111 15 of the operation of the LLR unit, substantially all of the backward vectors are calculated but 
HI only one or more reference vectors, from which the rest of the vectors may be easily re- 
!*J calculated, are stored. During a second stage, the backward vectors which were not stored are 
calculated from the stored reference vectors and are supplied to the LLR unit along with the 
ill forward vectors. 

l Zl 20 In a preferred embodiment of the present invention, during the first stage, reference 

i|| vectors are stored at predetermined intervals. In the second stage the reference vectors are 
retrieved in course from the storage and are used to re-calculate a group of non-stored vectors 
between a current and a previous reference vector. Preferably, a short term storage area is used 
to store the re-calculated vectors. Each group of non-stored vectors are preferably stored in 
25 place of the previously re-calculated vectors, as the previously re-calculated vectors were used 
already and are not needed any more. 

An aspect of some preferred embodiments of the present invention relates to using a 
reverse function in recalculating at least some of the vectors. In some preferred embodiments of 
the invention in which the backward vectors are calculated in the first stage using a reversible 
30 function, the backward vectors are calculated in the second stage using the reverse function of 
the function of the first stage. Thus, both the backward and forward vectors are calculated in 
the same direction of progression over the block during the second stage. In a preferred 
embodiment, the vectors calculated in the first stage are not stored. Alternatively, a plurality of 
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backward vectors from the first stage are stored, in order to limit possible round off errors due 
to the use of the reverse function and/or when singular functions are encountered. Further 
alternatively or additionally, the reverse function is used to re-calculate only some of the non- 
stored vectors and the rest of the recalculated vectors are calculated using the non-reversed 
5 function. 

There is therefore provided in accordance with a preferred embodiment of the present 
invention, a method of calculating recursive state metric vectors of a block of symbols, the state 
metric vectors being supplied to an output calculating unit, including pre-calculating the state 
metric vectors from a first side of the block, and calculating for a second time at least some of 
10 the state metric vectors from the first side of the block. 

Preferably, pre-calculating the state metric vectors includes pre-calculating the state 
metric vectors before supplying substantially any of the vectors to the output calculating unit. 
Preferably, the method includes storing fewer than all the pre-calculated vectors. 
Preferably, calculating for a second time at least some of the state metric vectors 
15 includes calculating those vectors which were not stored. 

Preferably, storing fewer than all the pre-calculated vectors includes storing a number 
of vectors which is about the square root of the number of pre-calculated vectors. Preferably, 
55 storing fewer than all the pre-calculated vectors includes storing vectors selected responsive to 

is: is 

m the locations of singular functions used in the pre-calculating of the vectors. Alternatively or 

20 additionally, storing fewer than all the pre-calculated vectors includes storing vectors selected 

*JTJ in predetermined intervals. Preferably, storing vectors selected in predetermined intervals 

includes storing vectors with equal intervals between them. Alternatively, storing vectors 
selected in predetermined intervals includes storing vectors with intervals of decreasing size 
between them. 

25 Preferably, calculating for a second time the at least some of the state metric vectors 

includes calculating the vectors based on the stored vectors. 

Preferably, calculating for a second time the at least some of the state metric vectors 

includes calculating at least some of the at least some of the vectors using a reverse function of 

a function used in the pre-calculating. Preferably, substantially none of the pre-calculated 
30 vectors are stored during the pre-calculation. Alternatively, substantially only pre-calculated 

vectors from which a next pre-calculated vector is calculated using a non-reversible function, 

are stored during the pre-calculation. 
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Preferably, calculating for the second time includes calculating at least some of the 
vectors using the function which was used in the pre-calculation. Alternatively or additionally, 
the pre-calculating is performed using a function which is an approximation of an original 
function and calculating for a second time the at least some of the state metric vectors includes 
5 calculating at least some of the at least some of the vectors using a reverse function of the 
original function. Further alternatively or additionally, the pre-calculating is performed using a 
function which is an approximation of an original function when the original function is non- 
reversible. 

Preferably, pre-calculating the state metric vectors from a first side of the block includes 
10 calculating forward state metric vectors. Alternatively, pre-calculating the state metric vectors 
from a first side of the block includes calculating backward state metric vectors. Preferably, the 
method includes calculating state metric vectors from a second side of the block after the pre- 
calculating. 

Preferably, pre-calculating the state metric vectors includes pre-calculating a number of 

15 vectors substantially of the size of an encoding block. Alternatively or additionally, pre- 
calculating the state metric vectors includes pre-calculating a number of vectors substantially 
smaller than the size of an encoding block. 

There is further provided in accordance with a preferred embodiment of the present 
invention, a method of calculating recursive state metric vectors from opposite sides of a block 

20 of symbols, the state metric vectors being supplied to an output calculating unit, including pre- 
calculating a plurality of state metric vectors from a first side of the block, storing fewer than 
all the pre-calculated state metric vectors, and calculating state metric vectors from a second 
side of the block after pre-calculating the plurality of state metric vectors from the first side of 
the block. Preferably, the method includes calculating at least one of the state metric vectors 

25 from the first side of the block for a second time after calculating at least one of the state metric 
vectors from the second side of the block. Preferably, calculating at least one of the state metric 
vectors from the first side of the block includes calculating the state metric vectors from the 
stored vectors. Preferably, storing fewer than all the pre-calculated vectors includes storing 
vectors calculated in predetermined intervals. Preferably, calculating the at least one of the state 

30 metric vectors includes calculating based on the closest stored vector. 

Preferably, the method includes presenting an output calculating unit with pairs of 
corresponding state metric vectors from opposite sides of the block. Preferably, the block of 
symbols is divided into a plurality of segments defined by the stored vectors and the method 
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includes calculating for substantially all the segments at least one of the vectors for a second 
time based on the respective stored vectors of the segment. Preferably, calculating for 
substantially all the segments includes calculating for all the segments except the first segment. 
Preferably, the method includes storing the at least one of the vectors which was calculated for 
5 the second time. Preferably, calculating the at least one of the vectors for the second time 
includes calculating substantially all the non-stored vectors of the segment. Preferably, the 
method includes storing substantially all the vectors of the segment which were calculated for 
the second time. Preferably, the method includes calculating at least one of the vectors from the 
first side of the block for at least a third time. 
10 There is further provided in accordance with a preferred embodiment of the present 

invention, a method of calculating recursive state metric vectors of a block of symbols, the state 
metric vectors being supplied to an output calculating unit, including calculating a plurality of 
state metric vectors representing a first side of the block using a first function, and calculating a 
plurality of state metric vectors representing the first side of the block using a second function 



II! 1 5 different from the first function. 
III Preferably, the first function is identical to a third function when the third function is 

:*? reversible and is received by adding a small value to the third function when the third function 

>:- is non-reversible. Preferably, the second function is substantially a reverse of the first function. 

h| Alternatively, the first function includes an approximation of a third function and the 

^ 20 second function includes a substantial reverse of the third function. 
:0 Preferably, the first function includes a log MAP function. 

faf Preferably, calculating the plurality of vectors using the first function includes 

calculating substantially all the vectors representing the first side of the block. 

There is further provided in accordance with a preferred embodiment of the present 

25 invention, a method of calculating recursive state metric vectors of a block of symbols, the state 
metric vectors being supplied to an output calculating unit, including calculating a plurality of 
state metric vectors representing a first side of the block recursively based on immediately 
preceding vectors in the block, and calculating a plurality of state metric vectors representing a 
second side of the block based on immediately preceding vectors in the block. 

30 Preferably, calculating the plurality of vectors representing the first and second sides of 

the block includes calculating pairs of vectors representing first and second sides of the block, 
substantially concurrently. 
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There is further provided in accordance with a preferred embodiment of the present 
invention, a state metric calculation unit, including circuitry for recursively calculating state 
metric vectors from a first side of a block including a predetermined number of symbol groups, 
and a memory for storing state metric vectors which has a maximal storage space for state 
5 metric vectors substantially smaller than that required to store the predetermined number of 
state metric vectors. 

Preferably, the maximal storage space of the memory has room for storing less than 
fifty percent of the predetermined number of state metric vectors. Further preferably, the 
maximal storage space of the memory has room for storing less than twenty percent of the 
1 0 predetermined number of state metric vectors. 

Preferably, the circuitry implements a plurality of different functions for calculating the 
state metric vectors. Preferably, the circuitry implements a pair of functions for calculating the 
state metric vectors which pair include mutual reverse functions. 

Preferably, the memory includes a long term storage area for storing pre-calculated state 
15 metric reference vectors in predetermined intervals and a short term storage area for storing 
state metric vectors between two reference vectors. Preferably, the long term storage area 
serves for storing also state metric vectors from between two reference vectors. 

BRIEF DESCRIPTION OF FIGURES 
The invention will be more clearly understood by reference to the following description 
20 of preferred embodiments thereof in conjunction with the figures, in which same number 
designations are maintained throughout the figures for each element and in which: 

Fig. 1 A is an exemplary trellis map of the convolutional code 7.5 which is known in the 

art; 

Fig. IB is a butterfly trellis diagram for convolutional codes which is known in the art; 
25 Fig. 2 is a schematic illustration of a decoder, in accordance with a preferred 

embodiment of the present invention; 

Figs. 3A and 3B are a flow chart of the actions performed in decoding received data 
symbols, in accordance with a preferred embodiment of the present invention; 

Fig. 4 is a schematic illustration of SM vectors of a decoded sub-block, in accordance 
30 with a preferred embodiment of the present invention; 

Fig. 5 is a schematic illustration of SM vectors of a decoded sub-block, in accordance 
with another preferred embodiment of the present invention; and 
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Fig. 6 is a schematic illustration of SM vectors of a large block, in accordance with a 
preferred embodiment of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Fig. 2 is a schematic illustration of a decoder 20, in accordance with a preferred 
embodiment of the present invention. Decoder 20 preferably receives soft data symbols along 
an input channel 22. A buffer memory 24 accumulates a sufficient number of symbols in order 
to decode a sub-block. The received symbols are grouped into groups of one or more symbols 
according to the encoding scheme of the received data symbols. For each symbol group, an 
LLR unit 30, calculates an output value corresponding to the location of the symbol group in 
the sub-block. The calculation is preferably based on the values of substantially all the symbol 
groups in the sub-block, as is known in the art, and described, for example, in the above 
mentioned publication of Steven S. Pietrobon. In the following description the accumulated 
sub-block is assumed to comprise LI groups of symbols (Rj,...,Rli). In addition, buffer 
memory 24 preferably accumulates L2 additional groups of symbols (Rl1+1>-">Rl1+L2) ^ e 
purpose of which is described hereinbelow. 

As is known in the art, a BM calculation unit 25 calculates branch metrics (BMs) which 
represent the groups of symbols from memory 24 in the calculations. Alternatively, memory 24 
accumulates the BMs which were calculated by BM calculation unit 25. 

A forward calculation unit 26 and a backward calculation unit 28 calculate forward and 
backward state metric (SM) vectors, respectively, based on the accumulated symbols, as is 
known in the art. 

LLR unit 30 uses the SM vectors from units 26 and 28, together with the BMs from unit 
25, to provide the output values for each of the groups of symbols in the sub-block, using any 
method known in the art. The output provided by LLR unit 30 for each group of symbols, may 
comprise hard-data and/or soft-data values, may comprise a decoded value or may comprise an 
intermediate value used in additional decoding steps. 

In order to calculate its output for each symbol group, LLR unit 30 receives the 
respective BM (5 n ) of the bit from memory 24 and the SM vectors (<x n , P n -fl) which 
correspond to the location of the symbol group within the sub-block, from both forward 
calculation unit 26 (a n ) and backward calculation unit 28 (P n +l)- The forward SM vectors 
(a^ are calculated by forward calculation unit 26 recursively from the beginning of the sub- 
block, based on a starting SM vector (oq) and the BMs (8i,...,8 n _i). The starting SM vector 
(cq) is preferably determined either based on the SM vector from a previous sub-block or 




based on a predetermined value from which the decoding begins, e.g., a vector of zeros. 
Likewise, the backward SM vectors (P n ) are calculated by backward calculation unit 28 
recursively from the end of the sub-block based on a last backward SM vector (PLl+l) of the 
sub-block and the BMs (5 n +i,...,5Li). The ending backward SM vector (pLl+l) * s °f a 
predetermined value or is calculated from the additional groups of symbols 
(Rli+i,...,Rli+l2) us i n 8 an Y method known in the art, for example, that described in the 
above mentioned PCT publication WO98/20617. The number of additional groups of symbols 
L2 depends, as described in the above publication, on the size of the encoder and on the desired 
accuracy of the calculations. 

Backward unit 28 preferably calculates the SM vectors in two stages. In a first 
preparation stage, performed before the beginning of the operation of LLR unit 30, backward 
unit 28 calculates the backward SM vectors recursively and stores a plurality of them as 
reference vectors to be used in the second stage. The reference vectors are preferably stored in a 
long term storage 38 of an internal memory 34. In the second stage, a decoding stage, 
backward unit 28 preferably operates in parallel with LLR unit 30 and forward unit 26 and 
calculates the backward SM vector (|3 n ) presently needed by the LLR unit, based on the closest 
stored reference vector of a higher index (the backward vectors (P n ) are calculated from the 
highest index to the lowest index recursively, using methods known in the art). In a preferred 
embodiment of the invention, internal memory 34 includes a short term storage area 36 in 
which the backward SM vectors between two reference vectors are stored between their 
calculation and use during the decoding stage. This is a recalculation of values previously 
calculated and thrown away. 

A controller 32 controls the timing and operation of the units of decoder 20, in 
accordance with the following description. 

Figs. 3A and 3B are a flow chart of the actions performed in decoding received data 
symbols by decoder 20, in accordance with a preferred embodiment of the present invention. 
Memory 24 accumulates (50) symbols belonging to a sub-block of LI symbol groups. In 
addition, memory 24 preferably accumulates (52) L2 additional symbol groups in order to 
calculate the last backward SM vector (pLl+l) °f ^ e sub-block. BM unit 25 preferably creates 
(54) respective representative BMs from the accumulated symbols, as is known in the art. 

Reference is also made to Fig. 4 which is a schematic illustration of the backward SM 
vectors (P n ) of the decoded sub-block, in accordance with a preferred embodiment of the 
present invention. In the preparation stage, backward calculation unit 28 calculates (57) 
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backward SM vectors (p n ), using the representative BMs of the accumulated symbols of the 
sub block. Backward calculation unit 28 first determines (56) the last backward SM vector 
(PLl+l) 100 of the sub block. Preferably, backward unit 28 chooses an arbitrary SM vector as 
the value of Pli+L2> f° r example a zero vector. Using the arbitrary vector and the L2 
additional symbol groups, backward unit 28 calculates recursively the SM vectors PL2+L1 to 
PLl+l- The calculation is performed as is known in the art, for example as described in any of 
the above referenced application. With the calculation of each of the vectors, the previous 
vector is preferably discarded as these vectors do not carry reliable data and are not needed for 
further calculations. Vector Pli+1 on the other hand is considered reliable as it is the result of 
L2 consecutive symbol group calculations and is preferably stored in long term storage 38, for 
later use. 

After backward SM vector Pli+1 is calculated, backward unit 28 preferably continues 
to recursively calculate (58) the SM vectors Pn, Pj+i and periodically stores (62) some of 
the vectors 102 in long term storage 38, for later use. The reference vectors 102 to be stored are 
preferably chosen according to a predetermined scheme. In some preferred embodiments of the 
invention, the reference vectors are distributed throughout the sub block in equal intervals of 
SM vectors 104. In a preferred embodiment, illustrated by blocks 60, 61 and 64, unit 28 stores 
every i'th backward SM vector, i.e., PLl+1-mi (m=0...M=Ll/i) where i is close to the square 
root of LI, e.g., between 6 and 10. In the following description it is assumed for simplicity that 
LI is divisible by i, and the stored backward reference vectors 102 are Pi+i, p2i+l> PLl+l- 

After the last backward vector 102 (Pi+i) to be stored in long term storage 38 is 
calculated, the decoding stage (70) is performed. The decoding stage is preferably formed of a 
plurality of steps (M steps). In each step LLR unit 30 provides output for i groups of symbols. 
Each of the steps, is preferably formed of two phases. In a first phase (66), backward unit 28 
preferably calculates the backward SM vectors (Pmi>.. .>P(m-l)i+l) needed for that step, and 
stores them in short term storage 36. Backward unit 28 preferably retrieves a reference vector 
(Pmi) 102 from long term storage 38, and calculates therefrom, recursively, SM vectors 
(Pmi-l>...>P(m-l)i+l)- ^ n a preferred embodiment, also the retrieved reference vector (p m i) is 
stored in short term storage 36. 

In the second phase (71) of each step, forward unit 26 calculates (68) the forward SM 
vectors (a n ) of that step, and units 26 and 28 provide (72) concurrently pairs of forward and 
backward SM vectors (oc n , P n +i) to LLR unit 30. Preferably, backward unit 28 provides the 
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SM vectors by retrieving them from short term memory 36. LLR unit 30 immediately 
calculates (74) its output responsive to the provided vectors and the relevant BM. 

Decoder 20 then preferably proceeds in decoding a following sub-block (returning to 
block 50), as is known in the art. It is noted that some of the operations in handling the 
5 following sub-block may be performed concurrently with the decoding stage 70 of the current 
sub-block. 

The number of reference vectors stored in long term storage 38 is preferably chosen in a 
manner which minimizes the required size of internal memory 34. Preferably, the minimization 
is performed in a manner which does not increase the required computational resources of 
10 calculation unit 28 above a desired threshold. In a preferred embodiment of the invention, the 
number of reference vectors i stored in long term storage 38, is approximately equal to the 
square root of LI (the size of the decoded sub-block), as using the square root minimizes the 
rj size of memory 34. Thus, the long term storage 38 is of a size which accommodates i vectors 

'%t and the size of short term storage 36 is, as well, one which accommodates i vectors. Thus, the 

Ul 15 total size of memory 34 is one which accommodates 2i SM vectors. 

ill Alternatively, long term storage 38 is larger, in order to reduce the amount of 

calculations required during the first phase (66) of the steps of the decoding stage. Further 
alternatively, the number of vectors 104 between each pair of reference vectors 102 is chosen 

let 

ill in a manner which simplifies the implementation of backward unit 28. In a preferred 
s «20 embodiment of the invention, the distance between consecutive reference vectors 102 is equal 
\Q to 2 n , n being an integer. Thus, the hardware implementation of backward unit 28 is simplified. 

Alternatively to having the same number of SM vectors 104 between the stored 
reference vectors 102, the number of vectors 104 between the stored reference vectors 102 
differ throughout the sub-block. In a preferred embodiment of the present invention, the 
25 number of vectors 104 between the stored reference vectors 102 increases with the indexes n of 
the symbol groups of the sub-block, i.e., the number of vectors 104 decreases with the 
progression of the backward calculation from the end of the sub-block to the beginning. During 
the first step of the decoding stage, long term storage 38 is full with SM vectors required at 
later steps of the decoding stage and therefore short term storage 36 must be used to store all 
30 the SM vectors calculated during the first phase (66) of the first step of the decoding stage. At 
later steps, however, the number of spaces in long term storage 38 which are used decreases 
and the vacant spaces are preferably used for storing some of the vectors calculated during the 
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first phase (66) of later steps of the decoding stage. In this embodiment, the border between 
long term storage 38 and short term storage 36 moves with the steps of the decoding stage (70). 

For example, in a preferred embodiment of the invention, the size LI of the sub-block 
is 100, and vectors Piq, P20> P31> P43> P56> P70> P85> 2 nd Pi 01 ^ stored in long term 
5 storage 38 during the preparation stage (57). In the first phase (66) of the first step of the 
decoding stage, calculation unit 28 calculates eight vectors, i.e., P2, - > P9> which are stored in 
short term storage 36. In the first phase of the second step of the decoding stage, unit 28 
calculates nine vectors, i.e., Pn, ... , Pig, P19. Eight of the vectors, e.g., Pn, ... , Pig, are 
stored in short term storage 36, and the ninth vector, e.g., Pi 9, is stored in long term storage 38 

10 in place of vector Piq which is not needed anymore. In each following step, the number of 
vectors calculated in the first phase 66 increases by one, and the additional vector is stored in 
long term storage 38 in the places of the vectors which are not needed anymore. Thus, the 
number of spaces required for storing vectors in memory 34 is equal to 17, relative to 19 
required spaces for evenly spaced reference vectors. 

15 For some decoding methods, such as for the log-map method, a function f used in 

calculating backward SM vector (p n ) from (p n +l) is a reversible function, such that a reverse 
function f~l may be used to calculate (P n +l) from (p n ). In some preferred embodiments of the 
present invention, the reverse function f~l is used to calculate the backward SM vectors (pn) 
from the reference vectors during the decoding stage (70). 

20 In a preferred embodiment of the present invention, the first backward SM vector of the 

sub-block (P2) is calculated during the preparation stage (57). During the decoding stage, 
backward unit 28 calculates vector (P n +l) from (P n ) using f"l, concurrently with the 
calculation of (ot n ) from (oc n _i) by forward unit 26. Preferably, in this embodiment memory 
34 comprises only long term storage 38, and the first phase (66) of the decoding stage (70), 

25 which is time consuming, is eliminated. 

In a preferred embodiment of the present invention, backward unit 28 does not include 
an internal memory 34. Rather, during the preparation stage (57) backward unit 28 calculates 
the first backward SM vector (P2) and uses the reverse function f"l to calculate all the 
backward SM vectors from P2, during the decoding stage (70). 

30 It is noted, however, that in some cases f"l introduces inaccuracies in the calculations 

due to round off problems of reverse functions. Therefore, as described above, in some 
preferred embodiments, reference vectors are stored in long term storage 38 during the 
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preparation stage (57), and the reverse function f-1 is used only between the reference vectors. 
Preferably, the distance between the reference vectors is set based on a maximal number of 
allowed consecutive reverse-function iterations R. The maximal number of reverse iterations R 
is preferably determined so that the round off error due to the use of the reverse function f~l is 
5 limited below a desired level. 

Fig. 5 is a schematic illustration of the SM vectors (Pn) of the decoded sub-block, in 
accordance with a preferred embodiment of the present invention. In some preferred 
embodiments of the present invention, only some of the backward SM vectors (P n ) are 
calculated using the reverse function f~l during the decoding stage (70). The rest of the 

1 0 backward SM vectors (p n ) are preferably calculated during the first phase (66) of the decoding 
stage (70) using the forward function f, as described above. For example, for each group of SM 
vectors 104, the first R vectors 107, which have lower indexes are calculated using f~l. The 
rest of the vectors, i.e., vectors 109, are preferably calculated from the nearest reference vector 
102 using function f, as described above. Stated otherwise, in each step of the decoding stage 

15 (70) backward unit 28 retrieves a reference vector 102 from long term storage 38. Using 
function f, vectors 109 are calculated and stored in short term storage 36, during the first phase 
(66). Thereafter, the same reference vector 102 is used with the reverse function f"l to 
calculate SM vectors 107. 

In a preferred embodiment, the distance between the reference vectors is determined 

20 using any of the considerations described above and based on the maximal number of allowed 
consecutive reverse iterations R. Designating the number of vectors which may be stored in 
short term storage 36 by SI, the number of vectors between two neighboring reference vectors 
102 is Sl+R. The number of reference vectors is S2=L1/(S1+R) and it is desired to minimize 
the space required in memory 34, i.e., S1+S2. 

25 In some preferred embodiments of the present invention, function f is a non-reversible 

function f\ which is an approximation of a reversible function f2, e.g., due to round off or 
when using the log-max method which is an approximation of the log-map method. In one of 
these embodiments, some or all of the calculations which are performed using f, are performed 
with f=f\ 9 the non-reversible function. Calculations performed with the reverse function f"l 

30 and possibly some of the calculations performed using f, are performed with f2 (f2~*)« 

Following is a showing of the reverse function f"l for exemplary functions f. 
Generating the reverse function equations for any other code will be obvious to those skilled in 
the art. 
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In the 7.5 code described hereinabove with reference to fig. 1 A, using the linear MAP 
method, function f is defined by the equation set: 
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The reverse function f"l is therefore defined by the following equations: 
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In some preferred embodiments of the present invention, when it is determined during 
calculation of a vector P n from P n +i in preparation stage (57) that the branch metrics 8 n form 
a non-reversible (singular) function, one or more of the branch metrics 8 n are slightly changed 
in order to remove the singularity. For example, referring to equations (1), when for a specific 
n, the matrix in either of equations (1) is singular, i.e., 5 n 00=5 n l * or Sn^^n*^ a va l ue I £ f° r 
a small 8 is added to the matrix. That is, a value e is added to 5 n ^0 or 5 n ^ as required. 
Alternatively, a value e is added to 5 n ^l or 5 n ^ 1 as required in order to remove the singularity. 

Alternatively or additionally, when the function for calculating vector P n from vector 
P n +1 is singular, calculating unit 28 stores P n +i such that it is not necessary to use the reverse 
of the singular matrix. Preferably, memory 34 comprises a special storage area (not shown) for 
vectors stored due to singularities. Alternatively or additionally, at least some of the reference 
vectors 102 are chosen based on the singularities. In a preferred embodiment of the present 
invention, calculation unit 28 stores reference vectors at equal intervals unless a singular 
equation is encountered in which case the vector on which the singular function is applied is 
stored and the equal vector intervals are preferably counted from that vector. In another 
preferred embodiment of the invention, the number of reference vectors 102 stored during the 
preparation stage (57) is constant and the intervals between reference vectors 102 when 
singularities are not encountered are adjusted according to the number of singularities 
encountered during the preparation stage. For example, preparation stage (57) may begin with 
storing each tenth vector. If a large number of singularities are encountered the interval 
between reference vectors is enlarged and, for example, every 15th vector is stored. If, 
however, less than an average number of singularities are encountered, the interval between 
stored vectors may be shortened. 

Alternatively to storing entire vectors P n -H to which singular functions are applied, a 
part of these vectors P n + j, including one or more elements, is stored. The one or more stored 
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elements preferably allow calculation of the rest of the elements f3 n +j from P n despite the 
singularity of the function. 

Alternatively or additionally to storing vectors to which singular functions are applied, 
vectors which cannot be calculated using the reverse function f~l due to the singularity, are 
calculated during the first phase (66) of the decoding stage (70) using function f. Preferably, 
for each interval between reference vectors 102, the number of vectors 109 (Fig. 5) calculated 
using function f, and the number of vectors 107 calculated using function f-1, are dynamically 
adjusted according to whether the interval includes singular functions and where the singular 
function appears within the interval. 

For the log MAP method, P n and 8 n have values which are the logarithms of their 
values in the linear case. Using the definitions in Fig. IB, the equations defining f for a pair of 
states x and y, are: 

J% = logCe* 1 + ) and /% = log^+^+l + ) . 

The reverse function f-1 is preferably defined by the equations: 

^ +1 = 1 ° g( c (4!^)_ e (4?^) )and W 

As LLR unit 30 is only interested in the relative values of the elements of backward 
vectors p n , equations (3) and (4) are preferably replaced by: 



/%+\ =log 



/ft 1= log 



+ C and (3") 



+ C (4') 



Preferably, for the log MAP method, functions f and f-1 are implemented partially or 
entirely using one or more look up tables (LUT). In a preferred embodiment of the invention, a 
LUT is used to approximate the function log(l+el x l) and log(l-el x l). Singularities are 
preferably handled using any of the methods described above regarding the linear MAP 
method. 

The number LI of symbol groups in each sub-block used in decoding is preferably 
chosen as a compromise between the required amount of memory space and the decoding 
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speed and accuracy. A conventional number of symbol groups LI used in the prior art is 
approximately 100. Although substantially any size LI of the sub-block may be used, in some 
preferred embodiments of the invention, the size LI of the sub-block is substantially larger 
than that conventionally used in the prior art. Using the methods of the present invention, the 
required size of memory 34 for a given size LI of the sub-block is significantly reduced 
relative to the prior art. Therefore it is possible to increase the size LI of the sub blocks, 
achieving faster and better decoding, without substantially increasing the required size of 
memory 34. In a preferred embodiment of the invention, the decoding is performed on entire 
blocks, thus eliminating the need to accumulate (52) the additional symbol groups and to 
calculate (56) Pl1+1- Thus, the additional time required relative to the prior art for performing 
the first phase (66) of the decoding stage, is partially compensated for. 

It is noted that the size LI of the sub-blocks used in decoding is not necessarily 
constant and LI may change during the decoding. 

The implementation of the present invention does not depend on the number of bits 
included in the state metrics in the forward and backward vectors, and substantially any 
number of bits may be used according to considerations known in the art. It is noted, however, 
that in some preferred embodiments of the present invention, the reduction in storage space 
achieved by the invention is used to increase the number of bits included in each state metric 
and thus increase the accuracy of decoder 20. 

Fig. 6 is a schematic illustration of the backward reference vectors (P n ) of a large 
block, in accordance with a preferred embodiment of the present invention. In some preferred 
embodiments of the invention, the preparation phase 66 comprises a plurality of sub phases, in 
order to further conserve memory space. Preferably, instead of calculating and storing all the 
SM vectors 104 between the reference vectors in one shot, requiring short term storage 36 to 
be of the size required to store all of vectors 104, backward unit 28 calculates and stores the 
vectors 104 in a plurality of sub phases. For example, if the large block comprises a thousand 
symbol groups and the number of vectors 104 between the reference vectors 102 is one 
hundred, during the first sub phase unit 28 may store every tenth vector 106. In a second sub- 
phase of preparation phase 66, unit 28 calculates and stores all the vectors 108 between vectors 
106. Thus, for a large block of a thousand symbols, the required storage space in memory 34 
comprises only room for about thirty vectors: ten vectors 104 in long term storage 38, and ten 
vectors 106 and ten vectors 108 in short term storage 36. 

It is noted that it is not necessary to store entire SM vectors in memory 34. For 
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example, in some embodiments the different states of the SM vectors are related in a manner 
which allows calculating the entire vector p n from half of vector p n and a complementary half 
of vector P n +i- In such embodiments, it is possible to store in long term storage 38 
complementary parts of consecutive vectors rather than entire vectors. 



reference to Fig. 1 A, rather than storing for each reference vector P n all the state metrics in the 
vector, backward calculation unit 28 stores the state metrics P n (01), p n (00), p n +i(01), and 
P n +1 (11). As can be seen from equations (1), p n (10) and p n (ll) can be calculated from 
Pn+l(01), and P n +i (11) and therefore storing the above state metrics is equivalent to storing 

10 vector P n (ignoring the additional calculations). 

In some preferred embodiments of the present invention, instead of storing all the 
elements of the reference vectors, only those elements of the vectors which are considered 
significant are stored, for some or all of the reference vectors. Alternatively or additionally, 
only some of the elements of the SM vectors stored during the decoding stage (in short term 

15 storage 36), are stored. The non-significant elements are preferably elements which have a 
state metric value much lower than other elements in the vector such that the chance of that 
element to be in a decoding trellis is very slight. 

It is noted that although the above description refers to forward decoding which requires 
pre-calculation of the backward SM vectors, the principles of the present invention apply 

20 equivalently to backward decoding in which the forward SM vectors are pre-calculated. 

Furthermore, although decoder 20 was shown as including only a single backward 
calculation unit 28, more than one unit may be used in parallel in order to speed up the 
operation of the decoder. In a preferred embodiment of the invention, unit 28 comprises two 
sub-units which operate in parallel during the decoding stage. Preferably, while a first sub-unit 

25 performs the second phase (71) of a current step, a second sub-unit performs the first phase 
(66) of a next step. Preferably, in this embodiment, memory 34 includes two short term storage 
spaces 36 which are used by the two sub-units interchangeably. 

It is further noted that the principles of the present invention apply substantially to any 
decoding method which implements the Bahl algorithm, and/or approximations of the Bahl 

30 algorithm, including the linear MAP version, the log MAP version, and the approximated log- 
max version. The principles of the present invention may be used with any finite state system, 
such as communication systems, and signal identification systems(e.g., handwriting, voice and 
image recognition). The communication systems may be used in decoding convolutional 
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For example, using as an example, the convolutional code 7.5 described above with 
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codes, trellis coded modulation (TCM) codes, BCH codes, or any other finite state codes. 
Alternatively or additionally, the principles of the present invention may be applied in 
performing channel equalization and/or in implementing an Inter symbol Interference (ISI) 
equalizer. 

It will be appreciated that the above described methods may be varied in many ways, 
including, changing the order of steps, and the exact implementation used. It should also be 
appreciated that the above described description of methods and apparatus are to be interpreted 
as including apparatus for carrying out the methods and methods of using the apparatus. 

The present invention has been described using non-limiting detailed descriptions of 
preferred embodiments thereof that are provided by way of example and are not intended to 
limit the scope of the invention. Variations of embodiments described will occur to persons of 
the art. Furthermore, the terms "comprise," "include," "have" and their conjugates, shall mean, 
when used in the claims, "including but not necessarily limited to." The scope of the invention 
is limited only by the following claims: 
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